PyDigger - unearthing stuff about Python


NameVersionSummarydate
contextgem 0.14.0 Effortless LLM extraction from documents 2025-08-02 21:57:12
docstrange 1.0.9 Extract and Convert PDF, Word, PowerPoint, Excel, images, URLs into multiple formats (Markdown, JSON, CSV, HTML) with intelligent content extraction and advanced OCR. 2025-08-01 15:43:50
document-data-extractor 1.0.4 Best open-source document to markdown extractor for LLM training data. Convert PDF, Word, PowerPoint, Excel, images, URLs to clean markdown, JSON, HTML locally. Alternative to Unstructured, Docling, Marker, MarkItDown, MinerU, PaddleOCR, Tesseract 2025-07-29 08:25:56
llm-data-converter 2.2.0 Best open-source document to markdown converter for LLM training data. Convert PDF, Word, PowerPoint, Excel, images, URLs to clean markdown, JSON, HTML locally. Alternative to Unstructured, Docling, Marker, MarkItDown, MinerU, PaddleOCR, Tesseract 2025-07-25 13:32:07
tikara 0.1.5 The metadata and text content extractor for almost every file type. 2025-01-26 23:33:40
hourdayweektotal
38147410516305910
Elapsed time: 2.27769s